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Because diffusion typically involves symmetric interactions, scant attention has been focused on 
studying asymmetric cases. However, important networked systems underlain by diffusion (e.g. 
cortical networks and WWW) are inherently directed. In the case of undirected diffusion, it can be 
shown that the steady-state probability of the random walk dynamics is fully correlated with the 
degree, which is no longer true for directed networks. We investigate the relationship between such 
probability and the inward node degree in modular networks, which we call efficiency. Our findings 
show that the efficiency of a given community depends mostly on the balance between its ingoing and 
outgoing connections. In addition, we derive analytical expressions to show that neither the internal 
degree nor the communities topology play a crucial role in their efficiency. The results are illustrated 
with respect to the macaque cortical network, providing subsidies for improving transportation and 
communications systems. 



I. INTRODUCTION 

Diffusion is one of the most fundamental dynamics 
in physics. In addition to being ubiquitous, diffusion 
also underlies important non-linear systems such as Tur- 
ing reaction-diffusion, Schrodinger, Fokker-Planck, and 
Navier-Stokes equations. Because these systems typically 
present symmetric interactions, few works have addressed 
the asymmetric counterpart. However, important net- 
worked systems in biology (e.g. cortical and metabolic 
networks), transportation (e.g. city traffic), and commu- 
nications (e.g. WWW and email networks) are typically 
directed and underlain by diffusion [T]. A particularly 
important property of such systems is how the asymp- 
totic probability of the random walk dynamics (activa- 
tion) can be predicted from intrinsic topological features. 
For instance, it is known that the activation is fully cor- 
related with the node degree for undirected networks [5] . 
However, this is no longer ensured in the case of diffu- 
sion in directed networks [3]. The present work reports 
an investigation on how the balance of directed edges in 
modular networks defines how activation increases with 
the inward node degree inside each community, a con- 
cept that we call efficiency. In particular, we develop an 
analytical model, showing that neither the internal de- 
gree nor the community topology play a crucial role in 
defining the efficiency. Though general, our findings are 
illustrated with respect to the macaque cortical network 

There are many studies concerning diffusion over undi- 
rected networks [6] , but the most recognized property 
of this dynamics is that its steady-state probability is 
completely defined by the number of neighbors (or de- 
gree) ki of a given node i [6 . To illustrate this, we con- 
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sider a random walk taking place on the network. If the 
walker is at node j at time t — 1, we can write the prob- 
ability of finding this agent at node i at time t as 
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where kj denotes the degree of node j, is the total 
number of nodes, and Aij is the element (i,j) of the 
adjacency matrix A. After a long period of time, the 
system is guaranteed to reach equilibrium [7j and we have 
•p,(i) = 'p^{t - 1) = V°° . In order to find P~ we can 
use the detailed balance condition V°°kj = Vfki [6], to 
obtain [5] 
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which we henceforth call the activity of node i. 

In the case of directed networks, the relation between 
the activity and the degree of a given node is not linear 
and remains not completely understood. In this work 
we focus on the study of some interesting properties that 
emerge when considering the direction of the edges. For- 
tunato et al. [5] studied the problem of searching in di- 
rected networks and found an expression for the PageR- 
ank dynamics, which is basically a random walk process 
with additional random jumps occurring with probability 
q. In the limit q — >■ 0, the PageRank reduces to our defi- 
nition for the activity of a node in a strongly connected 
network, that is. 
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where k™ is the number of connections that point to a 
node, called inward degree. 
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An example of directed network where this relation 
can be studied is the cortical network of the Macaque 
of the family Cercopithecidae [1] shown in Figure [ija) . 
In this network, each node corresponds to a cortical re- 
gion and the directed edges correspond to corticocortical 
tracts connecting these regions. This network has been 
largely studied in order to understand how stimulus in 
specific regions of the cortex can induce activation in 
other regions. Concerning the topological organization 
of this cortical network, we observe the presence of two 
communities [S], which are mostly related to the spatial 
position (anterior and posterior) of the nodes, as seen in 
Figure [ija). The anterior community corresponds to the 
regions responsible for performing high level tasks such 
as sensory integration and planning, while the posterior 
community is related to the primary sensory processing, 
including the cortical visual area in the occipital, tempo- 
ral and parietal lobes [10]. We also found an asymmetry 
of connections between these two communities, with the 
posterior module receiving more connections from the an- 
terior module than the other way around. In Figure [l|b) 
we show Ai in terms of fcj" for every node of this net- 
work, which contradicts the linear dependence between 
these variables suggested by Equation [3] The non- linear 
behavior shown in Figure [ijb) was first observed in Ref. 
[3], and is here verified to be related to the modular or- 
ganization of the cortical network, identified by the black 
and white nodes, which correspond to the posterior and 
anterior communities in Figure [Ija), respectively. 

How does the topology of the cortical network origi- 
nate the observed non-linearity? In order to address this 
question, we first need to consider the probability of the 
walker being at one of the communities. Suppose we have 
a network of TV nodes and n communities. For simplic- 
ity's sake, we also suppose that all communities have the 
same size N/n (the case for different sizes is straightfor- 
ward) . The evolution of the probability P,„ of finding the 
walker at community m, is given by 
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FIG. 1. (a) Largest strongly connected component of the 
Macaque cortical network partitioned in two communities. 
This component has 85 nodes and average degree (k) — 27.7. 
(b) Relationship between the activity and inward degree of 
each node. Nodes in (b) are colored according to the commu- 
nity they belong in (a). 



{Kmi, Kmn} conucctions, will move from community 
m to h. This probability can be expressed as 
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where k™* — X]r=i corresponds to the number 
of outward connections of nodes inside community to. 
Regarding Poissonian networks (e.g. Erdos-Rcnyi ran- 
dom graph). Equation [5] can be exactly solved, but since 
the node degrees of these networks are known to be 
well represented by the mean value of the associated de- 
gree distribution, it is reasonable to make the homoge- 
neous assumption and consider every node as having the 
same number of connections pointing to each community. 
Therefore, every node i inside the community to makes 
Kmh{i) — {f^mh) connections with the community h. Un- 
der this assumption, the probability Pmh is related to the 
mixing parameter [11] , and can be written as 
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Finally, we can write the activity of a node i located 
inside community m as 



where Pmh is the probability of the walker going from 
community to to h. In the equilibrium, dtPm — and 
the probability of the walker being at each community 
is given by the eigenvector tt associated to the unitary 
eigenvalue of the transition matrix T, whose elements are 
equal to Pmh- If we call Kmh the variable associated to 
the number of outward connections that a node inside 
community m has with community h, the elements of 
matrix T can be written as 
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where P(m — h \ Kmi, K,mn) is the conditional prob- 
ability that the walker, being in a node with exactly 
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where, (kJJJ) — X]ILi('^/m)- Equation [s] corresponds 
to the asymptotic probability of the walker being inside 
community to, tt™, times the activity of a node in a di- 
rected network, given by Equation [3] evaluated over the 
nodes inside community m. The coefficient is here 
called efficiency of the community to, since it represents 
the capacity of the community to to generate activity 
through the inward connections of its nodes. 

In order to test the accuracy of Equation |8j we pro- 
pose a model for modular directed network where it 
is possible to control the efficiency of each community. 
First, we define the n x n matrices k and r, that specify 
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FIG. 2. (a) Illustration of how the unbalance between com- 
munities implies in large efficiency differences, (b) invariance 
of the efficiency when the balance is kept and the density of 
community 1 is varied, (c) the same as before for communi- 
ties with different reciprocities. The dashed lines correspond 
to the predictions obtained from Equation [S] The respective 
parameters are shown above each graph. 



the desired internal (diagonal entries) and external (off 
diagonal entries) degrees and reciprocities, respectively. 
The elements of these matrices are given by (Kmh) and 
{fmh), respectively. The parameter (rmh) corresponds 
to the average reciprocity |T2] of the connections be- 
tween community m and h. Next, we create n undirected 
Erdos-Renyi networks with e edges and assign a random 
direction for every edge (j,j). In addition, a recipro- 
cal edge (j, i) is added to the network with a probabil- 
ity (rmm)-, which defines the reciprocity of network to. 
The final average internal degree of this network is then 
{i^mm) = ne{l + {rmm))/N . Finally, the same procedure 
is applied to connect the created networks between them- 
selves, but now considering the off-diagonal elements of 
K and r. 

In Figure [2f^a) we show the communities efhciency for 
the case n = 2> and = 1500 as a function of the aver- 
age number of connections between communities 1 and 
2, (ki2)- For each point in this figure, we created sev- 
eral networks using the proposed model. Through linear 
regression, we estimated the angular coefficient of the 
relation between activity and inward degree, which cor- 
responds to the points of the figure. The dashed lines 
represent the value of expected by Equation[8]for each 
community. It is clear that the homogeneous assumption 
works well and can correctly predict the communities ef- 
ficiency. 

The most striking feature of Equation |8] is that each 
community can have different efficiency. It is possible 
to show that if the communities are balanced, which 
is the case for all undirected networks, the internal de- 
gree cannot change the relative efficiency between com- 
munities. In order to show that, let us consider the 
case where all the communities are balanced and, con- 
sequently, {Kmh) = {Khm) for aU h ^ m, implying 
(C"') = (C)- Therefore, ^.^^ = i(C)/(fc'") is an eigen- 
vector of the transition matrix T, and 
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which does not depend on the community and recov- 
ers the previous result given in Equation [3j This re- 
sult means that denser communities have greater activ- 
ity, but the relation between activity and inward degree 
remains the same for every community. If we try to in- 
crease the activity of community m by adding edges in- 
side it, both its average inward degree (k™) and prob- 
ability will increase in the same proportion, making 
the efficiency nearly constant according to Equation [8] 
The only change comes from the increase in average in- 
ward degree (fc™) of the network, which in general cases 
is hardly affected by the change of only one community. 
We observed that such a behavior is also verified when 
the communities are unbalanced, as shown in Figure[2]^b) , 
or for networks with different reciprocities, as shown in 
Figure [2jc). 

To better understand the effect of unbalancing the con- 
nections between communities, we start with n balanced 
communities, and invert a fraction 1 — e of the outgoing 
connections between the community / and m. This is re- 
peated for every community m, so as to greatly increase 
the activity of /. We need to show that under reasonable 
conditions the efficiency of / after the rewiring process 
(^'f ) is higher than the rest of the network, i.e. C'f /Cm > 1 



for a given to. In order to do so we first observe that the 
new transition probabilities, pj^j/, , are given by 
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where = 1 if / = /i and otherwise. Therefore, 
in the limit e — > 0, the ratio ttj/tt^ for m between 
the elements of the eigenvector tt' of the new transition 
matrix, must diverge. On the other hand, assuming that 
{k^) denotes the number of inward connections of nodes 
inside community m after the inversion process, the in- 
ward degree ratio, given by 



(4") + (1 - e)j:{^fl) ^ 2{Kf) - {Kff) 



(11) 



will remain bounded, making the efficiency ratio be- 
tween communities / and to being necessarily higher than 
one. 

In the case of communities with scale-free topology 
[TT] , it is harder to find a closed expression for the activ- 
ity, but there is nothing indicating that the same effect 
cannot be observed. There is also no standard model 
to generate a modular network with scale-free topology 
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FIG. 3. Efficiency as a function of (K12) in a modular network 
whose communities have scale-free topology. All communities 
have {Kmm) = 6 and (rmm) = 1, Vm, (^13) = (K23) = 0.01 
and {k2i) = ~ (^32) = 0.5. Dashed lines correspond to 

the analytical prediction. 
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FIG. 4. Activity of the cortical network (circles) compared to 
the model (triangles) and the analytical prediction (dashed 
lines). The average degrees of the network are (kh) = 25.95, 
{K22) = 23.91, {K12) = 1.21 and {«:2i) = 4.35. The mean 
reciprocities are (rn) = 0.714, {r22) = 0.624 and (ri2) = 
{7-21) = 0.51. The communities have 42 and 43 nodes. 



in which the balance of connections between the com- 
munities can be systematically controlled. Nevertheless, 
we can imagine a very simple model where n Barabasi- 
Albert networks |13j are created and randomly connected 
in the same manner as in our previously presented model. 
This is not a truly scale- free network, but it can be shown 
that Pmh is given as 
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where a is a normalization factor. This expression 
should work well if the external degree of the nodes is 
smaller than the minimum degree of the BA networks. 
In Figure [3] we see that this is exactly what happens, 
in the sense that, equation 12 and the simulated results 
are in a good agreement for small values of (K12), but 
deviates as (K12) increases. We also see in Figure [3] the 
same variation in efficiency caused by the balance be- 
tween communities, which confirms that such behavior is 
topology independent. 

Therefore, we obtained a reasonable model to explain 
the non-linearity observed in the relation between activ- 
ity and inward degree in the cortical network. We found 
all the relevant parameters of each community previously 
detected in the cortical network, which were considered 
to perform a single realization of our model with Poisso- 
nian distribution. The result is shown in Figure [4j where 
we can see that the model originates a two-group activity 
that is very similar to that found for the cortical network. 
In addition, since the network has only two communities, 
we can write Equation |8] in a closed form for community 
1 as 
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with a similar expression holding for the second com- 
munity. The predicted values of activity using Equation 
13 are shown in Figure[4] where we see a good agreement 
between the analytical and original activity of the given 
cortical network. The small difference likely comes from 
the nontrivial degree distribution and correlations in the 
real network which were not taken into account in our 
model. 

The loss of correlation in directed networks has been 
an intriguing phenomenon. We have shown in this arti- 
cle that it is intrinsically associated to the unbalance of 
directed connections between communities in a network. 
We also found that the communities topology and edges 
reciprocity do not significantly affect their respective effi- 
ciency. The derived equations provide the basis not only 
for better understanding the effect of modular structure 
in diffusion dynamics, but also for planning strategies for 
increasing or decreasing the number of accesses to nodes 
in real-world networks. For example, in the context of 
the WWW, if we want to increase the activity inside a 
given module, it may be more efficient to add connections 
coming from other modules than to increase the density 
of connections inside the module. The implications are 
particularly relevant for neuroscience, where the systems 
are almost invariantly modular and directed. It would 
also be interesting to study the efficiency along topolog- 
ical scales, as well as to extend the results reported in 
this work to other types of dynamics such as integrate 
and fire and synchronization. 
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